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Abstract 


Previous studies in vitro of the processing of cloned polyprotein fragments from the coronavirus infectious 
bronchitis virus (IBV) large open reading frame (ORF1), confirmed the activity of a predicted 3C-like proteinase 
(3CLP) domain and suggested that the proteinase is released autocatalytically from the polyprotein in the form of a 
35 kDa protein, 3CLpro, capable of further cleavages in trans. In order to identify such cleavages within the ORF1 
polyprotein mediated by 3CLpro, the proteinase was expressed in bacteria, purified and used in trans cleavage assays 
with polyprotein fragments lacking the 3CLP domain as targets. The proteinase was expressed as a polyprotein 
fragment which was able to process during expression in bacterial cells, releasing mature 3CLpro. A histidine (His,) 
tag was introduced close to the C-terminus of the proteinase to aid purification. Processing demonstrated by the 
tagged proteinase was indistinguishable from that of the wild-type enzyme indicating that the site chosen for the tag 
was permissive. From these studies we were able to demonstrate trans cleavages consistent with the use of most of 
the previously predicted or identified sites within the open reading frame of gene 1. This tentatively completes the 
processing map for the ORF1 region with respect to 3CLpro. © 1999 Elsevier Science B.V. All rights reserved. 
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1. Introduction tural proteins within two large open reading 
frames, ORFs la and 1b, situated at the 5’ end 

The coronavirus infectious bronchitis virus and occupying the major portion of the 27.6 kb 
(IBV) encodes the major portion of its non-struc- single-stranded positive-sense RNA genome 


(Brown and Brierley, 1995). The ORFs overlap 
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lation products. The large size of the ORFs sug- 
gests that the translation products are 
polyproteins that are processed to generate a set 
of mature non-structural proteins. In common 
with all other coronaviruses so far examined, IBV 
encodes a 3C-like proteinase domain which is 
located towards the C-terminus of the ORFla 
polyprotein and approximately centrally in the 
ORF l1a/1b fusion product. The 3CLP domain was 
first identified as a possible component of IBV 
polyproteins (Gorbalenya et al., 1989) due to 
sequence similarity to picornavirus 3C proteinases 
(reviewed in Ryan and Flint, 1997). In the same 
study, predictions were made as to likely target 
dipeptide cleavage sites for a 3C-like proteinase 
within the ORF1 polyproteins, the large number 
suggested implicating this proteinase in a pivotal 
role in the maturation of the polyproteins. Similar 
domains were subsequently identified in mouse 
hepatitis virus (MHV) (Lee et al., 1991) and hu- 
man coronavirus (HCV 229E) (Herold et al., 
1993). There are no cellular counterparts for the 
activities of the 3C-like proteinases, therefore they 
are essential to the successful maturation of their 
respective polyproteins and subsequent replication 
events. This, and the comparatively high degree of 
conservation (at functional level), makes the 3C- 
like proteinases attractive targets for antiviral 
intervention. 

Given the essential role played by the 3C-like 
proteinase of the coronaviruses in maturation of 
the ‘polymerase’ polyproteins, there has been con- 
siderable interest in characterising the enzyme, 
with initial emphasis on defining the processing 
events involving the proteinase. Such studies are 
already at an advanced stage for both IBV (Liu et 
al., 1994, 1998) and HCV 229E ORF1b regions 
(Grotzinger et al., 1996; Heusipp et al., 1997a,b). 
3C-like proteinase activities have been confirmed 
recently for three representatives of the coro- 
navirus group, MHV (Lu et al., 1995), HCV 229E 
(Ziebuhr et al., 1995) and IBV (Tibbles et al., 
1996) and the protein has been identified in vivo 
during viral infection in the case of HCV (Ziebuhr 
et al., 1995) and MHV (Lu et al., 1996). In 
addition, the proteinases from these two viruses 
have been purified and partially characterised 
(Seybert et al., 1997; Ziebuhr et al., 1997). The 


identification of the IBV 3CLpro was first realised 
in vitro when a cloned fragment of the 
polyprotein was expressed in reticulocyte lysate 
(Tibbles et al., 1996). In the presence of mem- 
branes, upon which activity was vitally dependent, 
processing of the region containing the protease 
domain was observed with the release of a 35 kDa 
protein (p35). This product was capable of trans 
processing, but was inefficient and of limited use 
for mapping studies. We undertook the identifica- 
tion of the remaining IBV 3CLpro cleavage 
targets within the ORF1 polyprotein using bacte- 
rial expression to obtain significant quantities of 
3CLpro. 


2. Expression and purification of 3CLproH6 


The bacterial expression system described 
utilises the minimum processing unit identified in 
our previous studies which established that, in 
addition to the 3CLP domain, MP2 protein se- 
quence between the Q/S, cleavage target and a 
point delimited by an Ncol restriction site (ntd 
position 10118) was that minimally required for 
processing. A BamHI fragment released from 
pPKT205 (see below) was introduced into BamHI 
digested pET3xa (Novagen) to yield pET205 (Fig. 
1). This construct would allow expression to be 
monitored through the synthesis of the strongly 
expressed T7 gene 10 protein, which composes the 
major insoluble fraction component upon extrac- 
tion. Expression beyond the NcoI site was re- 
stricted by digestion of pET205 with NcoI 
followed by end repair and religation. This ma- 
nipulation introduces a frame-shift in the se- 
quence which results in a translation stop signal 
being encountered. This plasmid was then used to 
create an internal deletion between the vector 
initiator ATG triplet and the IBV sequence by 
digestion with NheI and BamHI. The newly gen- 
erated plasmid, pETAN205ACN°, was modified 
to include a His-6 tag approximately 35 amino- 
acid residues from the C-terminus of 3CLpro to 
aid in purification. A mutation which replaced the 
amino-acid sequence VKNSQW (single letter 
code) with six histidine residues near the C-termi- 
nus of the 3CLP domain in pKT205 (ntds 9679- 
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Fig. 1. Schematic representation of the IBV ORF la/1b polyprotein with positions of predicted cleavage dipeptide targets shown as 
solid vertical bars, those with arrowheads have been experimentally confirmed. Predicted catalytic domains are also indicated 
(MP1/2; membrane proteins, 3CLP; 3C-like proteinase, POL; polymerase, Zn; zinc finger, HEL; helicase). The derivation of the 
eukaryotic and bacterial expression clones and the principal cloning vectors used are depicted. The principal restriction sites used 
in cloning manipulations (adjacent solid bars), promoters (T7, SP6 and Ptac) and translation (Tln) start sites for the plasmids are 
also shown. 
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9696) was made, giving rise to pK T205His. The 
positioning of this tag sequence was determined 
by consideration of the possible amino-acid se- 
quences involved in cleavage at the Q/S, site and 
potential for interaction with the affinity matrix 
based on predicted hydrophilicity (Kyte and 
Doolittle, 1982) of the proteinase sequence. This 
modified sequence was transferred to the 205- 
bearing pET plasmid via replacement of a Nsil 
fragment. The processing properties of the 
modified proteinase were tested extensively, using 
a variety of systems, and were found to be indis- 
tinguishable from wild-type (data not shown). 
Therefore the appropriate pET plasmids (Studier 
et al., 1990) were introduced into E.coli BL21 
pLyssS cells (Studier, 1991). Cultures were grown 
for approximately 3 h until the OD¢go9. was be- 
tween 0.6 and 0.9 and expression induced by the 
introduction of IPTG (at 100 ug/ml) into the 
growth medium. The cells were harvested after a 
further 3 h. Cells were lysed, clarified and the 
supernatant applied to a chelating agarose column 
charged with cobalt chloride. The use of cobalt 
instead of the usual nickel resulted in lower bind- 
ing of background material (data not shown). 
Alternatively, cultures were induced after 1.5—2 h 


A 


(OD600 0.3 approx.) and rifampicin was added (to 
200 pg/ml) after a further hour and incubation 
continued for 3 h. This resulted in a substantial 
inhibition of cellular protein expression without 
affecting ongoing T7-dependent expression. Al- 
though this resulted in a lower overall protein 
yield, there was a higher proportion of 3CLpro 
and background binding for the metal chelate 
column selection was significantly reduced. Typi- 
cal results are shown in Fig. 2 where the appear- 
ance of a novel 35 kDa protein was observed after 
induction (compare lanes | and 2, panel A). Clar- 
ification of the crude lysate (lane 3) by centrifuga- 
tion revealed that the major portion of the protein 
of interest (i.e. 35 kDa) remained in the superna- 
tant (lane 5, compare to lane 4, pellet fraction). 
Comparison of the total applied protein and the 
unbound fraction (lanes 5 and 6 respectively) 
revealed that over 90% of the novel band seen 
after induction was retained on the column. This 
putative 3CLproH6 bound with relatively low 
affinity and was seen to elute at 60mM imidazole 
(lane 8), conditions normally associated with re- 
moving background binding. No observable 
protein remained to be eluted with 1M imidazole 
(lane 9). The background contaminating proteins 


Fig. 2. Expression and purification of 3CLproH6. A. Samples are total lysates from noninduced (lane 1) and IPTG-induced (lane 
2) cultures of E.coli BL21 cells transformed with pETAN205ACN°°His or fractions made during purification. These are the total 
lysate (lane 3), pellet (lane 4) and clarified supernatant (lane 5) before application to the cobalt column, the unbound fraction (lane 
6) and eluted fractions at 5 mM (lane 7), 60 mM (lane 8) and 1M (lane 9) imidazole. B. Comparison of the eluted proteins during 
normal induction (lane 1, equivalent to lane 8, panel A) and during induction in the presence of rifampicin (lane 2). All samples were 
normalised before separation on 17.5% polyacrylamide gels which were stained with Coomassie brilliant blue. The position of the 


induced 35 kDa product is indicated by open arrowheads. 
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could be significantly reduced by expression in the 
presence of rifampicin. While this resulted in a 
reduction in the total protein yield of some 80%, 
the proportion that was 3CLproH6 was greater 
and resulted in less background in the purified 
product (panel B, compare products under nor- 
mal conditions, lane 1, with those obtained in the 
presence of rifampicin, lane 2, after single step 
purification on a cobalt column). Only fractions 
seen to contain the putative 3CLproH6 were 
found to cleave target polypeptide targets in a 
trans assay which supports the identity of the 
induced product as 3CLproH6 (data not shown). 
From comparison with known concentration 
protein standards (BSA) we estimated the yield to 
be approximately 2—4 mg/l of culture (0.5—1 mg/l 
in the presence of rifampicin), which compares 
favourably with that obtained by Ziebuhr et al., 
1997, 


3. Identification of possible 3CLpro target sites 


Different regions of the ORFla/1b polyprotein 
were expressed in reticulocyte lysate (TnT, 
Promega) or bacterial cell extract (S30, Promega) 
in the presence of [*°S] methionine using the fol- 
lowing expression plasmids (see Fig. 1). Construc- 
tion of the pKT205 and pKTBP5 plasmids 
(encoding IBV sequence from nucleotides (ntd) 
8693-10927 and 10752-12600 respectively) has 
been described elsewhere (Liu et al., 1994). These 
encode predicted cleavage sites Q/S; to Q/G, and 
Q/S; to Q/So, respectively, within the C-terminal 
region of ORFla. IBV sequence between ntd 
16785 and 23060 (Boursnell et al., 1987), am- 
plified by RT-PCR using a first-strand cDNA 
synthesis kit (Pharmacia) and Pfu polymerase 
(Stratagene), was cloned into pCRScript (Strata- 
gene). This plasmid, pCRF4, was digested with 
Esp! and PstI (following end repair of EspI site 
with DNA polymerase) and the fragment intro- 
duced into EcoRV-—PstI digested pKT0/NS1 vec- 
tor (Tibbles et al., 1995) to produce pKT1bC 
(encoding nucleotides 16841-20926). The 
polyprotein fragment encoded contains the three 
predicted 3CLpro cleavage sites Q/G,,, Q/S,> and 
Q/S,; from the C-terminal region of ORF 1b. 


Cleavage dipeptides were mutated (to G/S or A/S) 
by oligonucleotide-directed mutagenesis as de- 
scribed previously (Tibbles et al., 1996). This re- 
gion was transferred as a Spel fragment to 
pGEX4T2 (Pharmacia) and deletions made to 
express portions containing each target dipeptide 
individually. The N-terminal half of the ORFla 
region (up to nucleotide position 6500 approx.) is 
encoded by the plasmid pKT1a3, constructed as 
described previously (Liu et al., 1995). Analysis of 
the Q/G; site was made using a KpnI deletion of 
the pET205 plasmid, pET205A*"". A Hincll frag- 
ment (ntd 10438-10902) bearing the Q/G, target 
from pKT205 was introduced into pGEX4T2 to 
yield pGEXMP2C (resulting in the production of 
a fusion with glutathione-S-transferrase (GST), 
Smith and Johnson, 1988). The region encoding 
the MP1 region and Q/G, was derived from PCR 
amplification and a fragment (ntds 6919-8693) 
cloned upstream of the 205 sequence in pKT205/ 
BPS. 

Purified proteinase was added in varying 
amounts to the radiolabelled target fragments and 
incubated for up to 16 h at room temperature. 
Cleavage patterns were analysed following poly- 
acrylamide gel electrophoresis and an example of 
these analyses is shown in Fig. 3. Complete diges- 
tion conditions for the fragment corresponding to 
the BP5 region of the polyprotein revealed a 
cleavage pattern that was consistent with the pat- 
tern expected from the cleavage predictions made 
by Gorbalenya et al., 1989. This target contained 
the predicted cleavage sites of Q/S,, Q/S, and 
Q/So, all of which, along with an additional un- 
predicted Q/N site, have been experimentally 
confirmed (Liu et al., 1997; Ng and Liu, 1998). 
Cleavage within BPS would be expected to yield 
processed products of approximate sizes of 6, 24, 
10, 17 and 3 kDa (as they lie on the polyprotein) 
assuming cleavage at Q/S,, Q/N, Q/S, and Q/So. 
In Fig. 3 the effect of processing by the addition 
of 3CLproh6 can be seen. The uncleaved target 
(lane 1) consisted of two species (of approximately 
60 and 70 kDa) because the sequence contained 
the frame-shift signal, followed by approximately 
300 nucleotides from ORF 1b. Only the two 
largest cleavage products were resolved in this 
experiment (lane 2) which correspond to cleavage 
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Fig. 3. Trans processing activity of 3CLproH6 against Q/S, 
target dipeptide. Samples of in vitro translations of the pK- 
TBP5-derived transcript in the presence of [°°S] methionine 
were incubated in the absence (lane 1) or presence (lane 2) of 
1 wg of purified 3CLproH6 for 16 h. Processing of [*°S] 
methionine-labelled mutated BP5 target is shown in lane 3 and 
the control incubation in lane 4. Separation was by elec- 
trophoresis on 17.5% polyacrylamide gels and products de- 
tected by autoradiography. The positions of markers are 
indicated at left and processed products indicated by arrows to 
the right. 


at Q/S, and Q/N for the largest product of 24 
kDa and at Q/S, and Q/S, for the smaller of 17 
kDa. The identities of these two products were 


Table 1 


reinforced by comparing them with those pro- 
duced after incorporation of [*°S] cysteine. The 
relative intensity of the two bands is reversed 
reflecting the inverse ratio of the two amino acids 
in the 24 and 17 kDa products (data not shown). 
No cleavage was apparent in the absence of 
3CLproH6 (data not shown). Cleavage at the 
predicted Q/S, site was tested by mutation of the 
cleavage dipeptide to an uncleavable form. Fail- 
ure to cleave at the mutated Q/S, site should 
result in the extension of the N-terminus of the 24 
kDa product by 6 kDa resulting in the generation 
of a new cleavage product of approximately 30 
kDa. Comparison of the cleaved wild-type 24 kDa 
product (lane 2) and the product from the mu- 
tated polyprotein fragment (lane 3) revealed a 
decrease in the mobility of the more intensively 
labelled product (equating to approximately 30 
kDa), consistent with failure to cleave at the 
mutated Q/S, site. 

Polyprotein fragments constituting the remain- 
ing areas of the ORFI1 region were synthesised 
and similarly analysed for cleavage by purified 
3CLproH6. The results of all of these trans assay 
analyses are summarised in Table 1. We found 
that all so far identified target dipeptides are 
utilised by the 3CLproH6 enzyme in vitro. It was 


Processing targets identified within the ORFla/b polyprotein of IBV 


Clone Nucleotides Expression sys- Predicted site Trans cleavage in Dipeptide muta- Identification in 
tem vitro tion vivo 
pKT1la3 364-6495 RRL* None None Irrelevant No 
pKT3’la 6919-8448 RRL* Q/G, Not cleaved Not done No 
pKT205 8693-10438 RRL‘ Q/S, Cleaved Not cleaved No 
Q/S, Cleaved Not cleaved No 
pETMP2N 10118-10438  $30¢° Q/G; Not cleaved Not done No 
pGEXMP2C 10438-10902 = S30° Q/Ge Not cleaved Not done No 
pKTBP5 10752-12600 RRL* Q/S, Cleaved Not cleaved Yes 
Q/N* Cleaved? Not done Yes 
Q/Sg Cleaved? Not done Yes 
Q/So Cleaved® Not done Yes 
pKT220/249 12625-15500 RRL* Q/Sio9 Cleaved Not done Yes 
pGEX1bl1 16854-17591 — $30° Q/Gi, Cleaved Not cleaved Yes 
pGEX1b12 17591-19154 $30¢° Q/Si5 Cleaved Not cleaved Yes 
pGEX1b13 19154-20389 $30° Q/Si3 Cleaved Not cleaved Yes 


* Not previously predicted. 
> Inferred from cleavage patterns only. 
°RRL = rabbit reticulocyte lysate, S30 = bacterial lysate. 
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noted that under conditions expected to result in 
complete cleavage, not all target sites were di- 
gested to completion. These tended to be in 
ORF 1b. Only three of the predicted sites fail to be 
cleaved, namely Q/G,, Q/G; and Q/G, and no 
novel target sites were detected. The first of the 
uncleaved sites (Q/G,) lies upstream of both the 
3CLP and MP1 domains while the latter pair fall 
within the MP2 region. Either these sites are not 
utilised by 3CLpro or the conformation of the 
target fragments bearing them in our assays do 
not facilitate their recognition by the proteinase. 
However in the case of the Q/G,-bearing GST 
fusion, cleavage consistent with use of the 
thrombin site within the fusion junction region 
was observed (data not shown), which suggests 
that no gross conformational abnormality exists 
that might preclude proteolysis. 


4. Conclusions 


We have used bacterial expression of viral 
protein, comprising the IBV ORF la polyprotein 
fragment that gives rise to 3CLpro via autocata- 
lytic processing in vitro (Tibbles et al., 1996), to 
obtain significant amounts of highly purified ac- 
tive proteinase for trans processing studies. Our 
rationale for modifying the starting pET plasmid 
containing the 205 sequence was as follows. Ex- 
pression of hydrophobic proteins from _picor- 
naviruses with deleterious effects on bacterial cells 
has been reported (Lama and Carrasco, 1996). 
Therefore termination of expression at the NcoI 
site within MP2 would still allow processing, ac- 
cording to our in vitro data (Tibbles et al., 1996), 
while at the same time minimising the possible 
toxicity of the highly hydrophobic MP2 region. 
The gene 10 sequence was deleted in order to 
promote the production of a soluble polyprotein 
fragment, equivalent to that which is able to 
mature in vitro, and so ensure processing would 
proceed. Evidence suggests that in bacteria fold- 
ing is post-translational (Netzer and Hartl, 1997) 
and so there was potential for the proteinase to 
become inactive due to the characteristic seques- 
tration of pET fusion proteins into inclusion bod- 
ies. In the event this latter measure appeared 


unnecessary as soluble processed 3CLpro was 
produced by the cells expressing fusion proteins 
(data not shown). This suggests that folding and 
processing were completed if not before comple- 
tion of translation then before the sequestration 
into inclusion bodies. Thus by whatever means 
the proteinase matures in its usual eukaryotic 
context seems to be facilitated in the bacterial cell, 
though not necessarily by analogous interactions. 
The precise nature of the interaction between 
MP2 and cellular components (membranes) neces- 
sary for maturation of 3CLpro and its role in the 
mechanism of 3CLpro processing (maturation or 
release) remain to be determined. We do not 
know as yet whether the membrane requirement 
for maturation of 3CLpro also reflects a need for 
other associated cellular components such as 
chaperone functions. The fact that processing is 
able to proceed in an heterologous environment 
offers a possible means of investigating this 
further. 

In order to simplify purification of 3CLpro, we 
introduced a histidine (His,) tag which facilitates 
separation on a suitable metal chelation matrix. 
The tag could not be placed at the extreme N or 
C-termini of the protein as is usual since the 
cleavage dipeptide and surrounding sequences are 
crucial for correct processing (Ryan and Flint, 
1997). We introduced the tag into the nearest 
region of predicted hydrophilicity to the C-termi- 
nal processing site Q/S4 so as to minimise inter- 
ference with the sequences involved in processing 
and allow interaction of the exposed histidine 
residues with the purification matrix. The tagged 
proteinase (3CLproH6) appears to be indistin- 
guishable from wild-type and is readily purified to 
high purity. However, the affinity between 
modified proteinase and matrix is low and similar 
to background proteins suggesting that the tag 
may not be fully accessible. Our observation 
seems to conflict with the interpretation of a 
mutational analysis of HCV 229E 3CLpro by 
Ziebuhr et al., 1997 which concluded that the 
C-terminus of 3CLpro was important to function 
and intolerant of mutation. The His,-tag muta- 
tion here (a substitution) was well tolerated and 
would therefore appear to indicate a so called 
‘permissive site’ in the C-terminus of the protein. 
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We have shown by trans cleavage analysis that 
processing occurs at the majority of the predicted 
3CLpro cleavage dipeptides within the IBV ORF1 
polyprotein and that no further cleavages are 
apparent. Our data reinforce recently reported in 
vivo studies and also partially overcome a poten- 
tial shortcoming of these approaches where some 
products may not be consistently detectable, due 
to poor reactivity with antisera or short half life 
for example. This analysis tentatively completes 
the processing map of the ORFI region with 
respect to 3CLpro. There appear to be no target 
sites for 3CLpro in the N-terminal half of the 
ORF la polyprotein (covered by the 1a3 clone) so 
that processing of this region and possibly regions 
upstream is probably achieved through the PLP 
domain (Liu et al., 1995), alone or in combination 
with other as yet unidentified proteinase activities. 
Those target dipeptides that were not cleaved in 
our assays are associated with the hydrophobic 
domains. Cleavage of the MPI region down- 
stream of 3CLpro may be carried out by the PLP 
domain or an alternative activity, although failure 
for 3CLpro to cleave Q/G, may reflect a short- 
coming in the presentation of the targets. The 
data presented here also suggest that there is no 
processing between Q/S, (the C-terminus of the 
3CLP domain) and Q/S, so that the MP2 product 
containing uncleaved Q/G; and Q/G, dipeptides 
would be expected to be approximately 42 kDa. 
Notably these three hydrophobic region targets 
are not represented in either MHV or HCV 229E, 
although, along with IBV itself, these viruses bear 
a potential target at a position equivalent to just 
downstream of the IBV Q/G, site. These possibil- 
ities and the assembly of processed products into 
polymerase complexes may be addressed using 
specific antisera raised to amino or carboxyl ter- 
minal regions of processing products indicated by 
the present studies. 
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